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T 1: AF458589 . Reports Homo sapiens myos...[gi:2 1360805] 
Features Sequence 



LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



AF458589 4631 bp mRNA linear PRI 10-JUN-2002 

Homo sapiens myosin phosphatase target subunit 1 variant (PPP1R12A) 
mRNA, complete cds . 
AF458589 

AF458589.1 GI:21360805 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini ; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 4631) 
Guo,J.H., Chen,X.Y. and Yu,L. 
Direct Submission 

Submitted (15-DEC-2001) School of Life Sciences, Institute of 
Genetics, Fudan University, Handan Road, 220, Shanghai 200433, 
China 

Location/Qualifiers 
1. .4631 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db_xref="taxon: 9606 " 
/chromosome="12" 
/map="12q21.2" 
/tissue_type=" liver" 
1. .4631 

/gene="PPPlR12A" 
/gene_synonym="MYPTl" 
124 . .3111 
/gene="PPPlR12A" 
/gene_synonym="MYPTl" 
/codon_start=l 

/product="myosin phosphatase target subunit 1 variant" 
/protein_id=" AAM4 9717 .1 " 
/db_xref="GI: 21360806" 

/translation="MKMADAKQKRNEQLKRWIGSETDLEPPVVKRQKTKVKFDDGAVF 
LAACSSGDTDEVLKLLHRGADINYANVDGLTALHQACIDDNVDMVKFLVENGANINQP 
DNEGWIPLHAAASCGYLDIAEFLIGQGAHVGAVNSEGDTPLDIAEEEAMEELLQNEVN 
RQGVDIEAARKEEERIMLRDARQWLNSGHINDVRHAKSGGTALHVAAAKGYTEVLKLL 
IQAGYDVNIKDYDGWTPLHAAAHWGKEEACRILVDNLCDMEMVNKVGQTAFDVADEDI 
LGYLEELQKKQNLLHSEKRDKKSPLIESTANMDNNQSQKTFKNKETLIIEPEKNASRI 
ESLEQEKVDEEEEGKKDESSCSSEEDEEDDSESEAETDKTKPLASVTNANTSSTQAAP 
VAVTTPTVSSGQATPTSPIKKFPTTATKISPKEEERKDESPATWRLGLRKTGSYGALA 
EITASKEGQKEKDTAGVTRSASSPRLSSSLDNKEKEKDSKGTRLAYVAPTIPRRLAST 
SDIEEKENRDSSSLRTSSSYTRRKWEDDLKKNSSVNEGSTYHKSCSFGRRQDDLISSS 
VPSTTSTPTVTSAAGLQKSLLSSTSTTTKITTGSSSAGTQSSTSNRLWAEDSTEKEKD 
SVPTAVTIPVAPTVVNAAASTTTLTTTTAGTVSSTTEVRERRRSYLTPVRDEESESQR 
KARSRQARQSRRSTQGVTLTDLQEAEKTIGRSRSTRTREQENEEKEKEEKEKQDKEKQ 
EEKKESETSREDEYKQKYSRTYDETYQRYRPVSTSSSTTPSSSLSTMSSSLYASSQLN 
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RPNSLVGITSAYSRGITKENEREGEKREEEKEGEDKSQPKSIRERRRPREKRRSTGVS 
FWTQDSDENEQEQQSDTEEGSNKKETQTDSISRYETSSTSAGDRYDSLLGRSGSYSYL 
EERKPYSSRLEKDDSTDFKKRQERFADRSLLEMEKRERRALERRISEMEEELKMLPDL 
KADNQRLKDENGALIRVISKLSK" 

ORIGIN 

1 agatctgccc tgtagagcct tgcggttcca ctgctggcct ccggattccc gggagccccc 
61 agcccgacag gacaaccttc cttccccgct tcctcctttc cagacccggc ggcaggagag 
121 gggatgaaga tggcggacgc gaagcagaag cggaacgagc agctgaaacg ctggatcggc 
181 tccgagacgg acctcgagcc tccggtggtg aagcgccaga agaccaaggt gaagttcgac 
241 gatggcgccg tcttcctggc tgcttgctcc agcggcgaca cggacgaggt cctcaagctg 
301 ctgcaccgcg gcgccgacat caattacgcc aatgtggacg gactcactgc cctgcaccag 
361 gcttgcattg atgacaatgt tgatatggtg aagtttctgg tagaaaatgg agcaaatatt 
421 aatcaacctg ataatgaagg ctggatacca ctacatgcag cagcttcctg tggatatctt 
481 gatattgcag agtttttgat tggtcaagga gcacatgtag gggctgtcaa cagtgaagga 
541 gatacacctt tagatattgc ggaggaggag gcaatggaag agctacttca aaatgaagtt 
601 aatcggcaag gggttgatat agaagcagct cgaaaggaag aagaacggat catgcttaga 
661 gatgccaggc agtggctaaa tagtggtcat ataaatgatg tccggcatgc aaaatctgga 
721 ggtacagcac ttcacgttgc agctgctaaa ggctatacgg aagttttaaa acttttaata 
781 caggcaggct atgatgttaa tattaaagac tatgatggct ggacacctct tcatgctgca 
841 gctcattggg gtaaagaaga agcatgtcga attttagtgg acaatctgtg tgatatggag 
901 atggtcaaca aagtgggcca aacagccttt gatgtagcag atgaagacat tttaggatat 
961 ttagaagagt tgcaaaagaa acaaaatctg ctccatagtg aaaaacggga caagaaatct 
1021 ccactaattg aatcaacagc aaatatggac aataatcagt cacagaagac ctttaaaaac 
1081 aaagagacgt tgattattga accagagaaa aatgcatccc gtattgaatc tctggaacaa 
1141 gaaaaggttg atgaagaaga agaaggaaag aaggatgagt ctagctgctc tagtgaagaa 
1201 gatgaggaag atgactcgga atcagaagct gaaacagata agacaaaacc cctggcttct 
1261 gtaactaatg ccaacacttc tagtacacaa gcagctcctg tagctgttac aacacctact 
1321 gtgtcatcag gtcaagcaac acctacatca cctattaaaa agtttccaac cacagctaca 
1381 aaaatttctc ccaaagaaga agagagaaaa gatgagtctc ctgcaacttg gaggttagga 
1441 cttagaaaga cgggcagcta tggtgcactt gctgaaatca cagcatctaa agagggtcag 
1501 aaagaaaaag atactgcagg tgttacacgt tcagcttcaa gtcccagact ttcctcctct 
1561 ttggataata aagaaaagga gaaagatagt aaaggaacta ggcttgcata tgttgcacct 
1621 acaataccaa gacgactagc cagtacatct gacattgaag agaaagaaaa cagagattct 
1681 tcaagtttgc gaacaagtag ttcatataca aggagaaaat gggaagatga tcttaaaaaa 
1741 aatagctcag ttaatgaagg atcaacgtat cataaaagtt gctcctttgg tagaagacaa 
1801 gatgatttga ttagttctag tgttccaagc accacatcaa caccaacagt tacctctgca 
1861 gctgggcttc agaaaagcct gctttccagc acaagcacta ctacaaagat tacaacgggt 
1921 tcttcctcag caggcacaca aagcagtacc tcaaatcgtt tgtgggctga ggatagtact 
1981 gagaaagaaa aggacagtgt tcctacggca gtgaccattc ctgttgctcc aactgttgta 
2041 aatgctgcag cttctaccac aaccctgact acaactactg ctggcactgt ctcctccaca 
2101 acagaggtca gggagagacg cagatcatac ctcactcctg ttagggatga agagtctgaa 
2161 tcccaaagaa aagcaagatc tagacaagca agacaatcta gaagatcaac acagggagtg 
2221 acattaactg atcttcaaga agctgagaaa acaataggaa gaagtcgttc tacccgaacc 
2281 agagaacaag aaaatgaaga aaaagaaaaa gaggaaaaag agaaacaaga taaagagaaa 
2341 caagaagaaa agaaggagtc agaaacatct agagaagatg aatataaaca aaagtactcc 
2401 agaacgtatg atgagactta ccagcgttat aggccagtat caacttcaag ttcaaccact 
24 61 ccatcctctt cactttctac tatgagcagt tcactgtatg cttcaagtca actaaacagg 
2521 ccaaatagtc ttgtaggcat aacttctgct tactccagag gaataacaaa agaaaatgaa 
2581 agagagggag aaaaaagaga agaggagaaa gaaggagaag ataaatcaca acctaaatca 
2641 atcagagaac gacgacgacc aagagagaaa agaagatcta caggagtttc attttggaca 
27 01 caagatagtg atgaaaatga acaagaacaa caatcagaca cagaagaggg atccaataag 
2761 aaagaaactc agacggattc catttctaga tatgaaacca gttctacatc agctggtgat 
2 821 cgatatgatt ccttgctggg tcgctctgga tcatacagtt acttagaaga aagaaaacct 
2881 tacagcagca ggctagaaaa ggatgactca actgacttta aaaagagaca agaaagattt 
2941 gctgatagat cactgttgga aatggaaaaa agggaacgaa gagctctaga aagaagaata 
3001 tctgaaatgg aagaagagct caaaatgtta ccagacctaa aagcagacaa ccagaggcta 
3061 aaggatgaaa atggggcctt gatcagagtt ataagcaaac tttccaaata aaaaaaaaaa 
3121 agcagcaagt aatggaattg cacatattag taacccagtg gaccataatt ggcagtcact 
3181 ggaagtctgg gaagaatcct tggagactgt cattttcgga tatcctgcca aatgccctct 
3241 tatctagaat ttttgtttca ttttgtttaa ttttctgggg tgtttttgtt gttgttggtt 
3301 tgttttttgt tttttttttt aatcaagacc attgtttcat gttaatgcag ctgctgagaa 

33 61 gatttttttt taatgactga gaaaacttgt ttacagctcc agcatataag gaaagtgttc 
3421 aaggccagat atgcctcaga tatttaacca gtaagcctta gttgtacata aatacttttg 

34 81 tgtcaacaaa aactttcagc tctcacagaa gacagttact caacattttt tgatgtgcca 
3541 cagtttcgag tttttcgata tttaaatttt ttggcttttc atctaagttt gggtttgtat 
3601 tttttccttc taaactcttc atgtggcaga gtcttctatg ttttcacggc tttttcatta 
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3661 cagaaaagaa cacttgctct tctgtgatta ttgtcatgta ttaggctaat gctgtgttgt 
3721 ctcccacctg gaactgaatt gcttggtgga acatatgctt tcactgtttg tgcaatatgc 
3781 atttatttct tatatgaatg ctttaaagtc atttgaggtt agatctttta attcctattt 
3841 tctgcttcat tggtcacttt ttttttattg tagtataaga tgttagattc tgtaatcttc 
3901 acattcattt tagcaggtac tgagtgatgc tgtatataca aataagtgta ttgttttgat 
3961 ttttagacca ccacatggca tgcttgacta tttcttattt caaatgtctg ctaatgcaga 
4021 gtaggctact ccatgatagt gttaaaaaac aaaatttgct aacaatgtga tataaagact 
4081 ttaaaagtta cacattatgt ggagccctat ctttacaaaa gtttcctact gtaaagtgct 
4141 tttattttca gttttcattt gatagtactc aaccataatt aaagttgcat aagataattg 
4201 ctttacattt cacataccta tatttatctg agtgctgtct aaaactgttg tgctagccaa 
4261 agtaatgcta tgaaatcatt tgcagaatta acccgtgagt taatgttaaa tgcactgtta 
4321 ttgccatgtg aagaggcatc gactttgata ccaccatcat gttcagacca ttttatacat 
4381 ttcagtggcc tttttttttt taaggaaaaa aaagcgcaaa accaagtaca tagtgacgat 
4 441 ggcttttatt tggacaaata gcttttatat tttcattaaa ccatgcaaaa aatactacat 
4501 ctttctggca cataactgtc tccttaacca ctggaacagt tcagccattt gaataaattg 
4 561 tacattgtaa agcttatagt agctgattgt attattgatt gtattgtata ctatattaaa 
4 621 tgtgaatttg t 
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Tl: U79283 . Reports Human albumin D-b...[gi:1710256] 
Features Sequence 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



HSU79283 14 80 bp mRNA linear PRI 28-NOV-2000 

Human albumin D-box binding protein mRNA, complete cds . 

U79283 

U79283.1 GI: 1710256 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1480) 

Andersson,B. , Wentland,M.A. , Ricafrente, J.Y. , Liu,W. and Gibbs,R.A. 
A 'double adaptor' method for improved shotgun library construction 
Anal. Biochem. 236 (1), 107-113 (1996) 
8619474 

2 (bases 1 to 1480) 

Yu,W., Andersson,B. , Worley,K.C, Muzny, D . M . , Ding,Y., Liu,W., 
Ricafrente, J.Y. , Went land, M . A. , Lennon,G. and Gibbs,R.A. 
Large-scale concatenation cDNA sequencing 
Genome Res. 7 (4), 353-358 (1997) 
9110174 

3 (bases 1 to 1480) 
Yu,W. and Gibbs,R.A. 
Direct Submission 

Submitted (22-NOV-1996) Molecular and Human Genetics, Baylor 
College of Medicine, One Baylor Plaza S930, Houston, TX 77030, USA 
Location/Qualifiers 
1. .1480 

/organism="Homo sapiens" 
/ mo l_t yp e = " mRN A " 
/db_xref ="taxon : 960_6" 
/clone="23601" 
/sex="f emale" 
/tissue_type="brain" 

/clone_lib="Soares library 1NIB from IMAGE consortium" 
/dev_stage=" infant" 
339. .1316 

/note="similar to human albumin D-box binding protein 
encoded by GenBank Accession Number U06936" 
/codon_start=l 

/product="albumin D-box binding protein" 
/protein id=" AAB50219 . 1 " 
/db_xref="GI: 1710257" 

/translation="MARPVSDRTPAPLLLGGPAGTPPGGGALLGLRSLLQGTSKPKEP 
ASCLLKEKERKAALPAATTPGPGLETAGPADAPAGAVVGGGSPRGRPGPVPAPGLLAP 
LLWERTLPFGDVEYVDLDAFLLEHGLPPSPPPPGGPSPEPSPARTPAPSPGPGSCGSA 
SPRSSPGHAPARAALGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSIP 
GHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNEAAKRSRDAR 
RLKENQI SVRAAFLEKENALLRQEWAVRQELSHYRAVLSRYQAQHGAL " 
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1 gcagattttg caagagccag gctcgcccac cttgtagaag gagcgccttg agtcccctct 
61 caccctcggt tgcaaagagc cgaccgcttg atctggacac cccctcgccc agattgcatg 
121 atctcccggg accctcttga gttgcacgtt tctgcaccga ggacctcaaa tccccgtcgc 
181 tcctaggatt tgcagcgttc tggatactgg agggttgcag gctacactcg cccgcccctg 
241 ggcagacact cgtccaaacc actggagtgt gctggtgact ggcaggccag cccttcgcct 
301 ctccatgaac ccgtgagcct gggggcaggt gccaggcgat ggcgcggcct gtgagcgaca 
3 61 ggaccccggc ccctctgctg ctgggcggcc cggccgggac accccctggc gggggagcgc 
421 tgcttgggtt gcggagcctt ctgcagggga ccagcaagcc caaagagccg gccagctgtc 
481 tcctgaagga aaaggagcgc aaggcggccc tgcctgcagc cacaacccct gggccaggcc 
541 tggagactgc gggcccggcg gatgccccgg ctggggcagt ggtgggcgga gggtccccgc 
601 gggggcgccc ggggccggtg cccgccccgg gtctgttggc gccactgctg tgggagcgca 
661 cgctgccgtt cggcgatgtg gagtacgtag acctggacgc cttcctgctg gagcacgggc 
721 tcccgcccag cccgccgccc cccggtggcc cgtcgccgga gccgtcgccc gcgcggacgc 
781 ccgcaccctc cccagggccg ggttcgtgcg gctcggcttc cccccgctcc tctcctgggc 
841 acgcccccgc ccgggctgcc ctcgggaccg ccagcggcca ccgcgcaggc ctgacctctc 
901 gggacacacc cagccctgtg gacccagaca ccgtggaggt gttgatgacc tttgaacccg 
961 acccagctga tcttgcccta tcaagcattc ctggccacga gacctttgac cctcgaagac 
1021 atcgcttctc agaagaggaa cttaagcccc agccaatcat gaagaaggca agaaaaatcc 
1081 aggtgccgga ggagcagaag gatgagaaat actggagccg gcggtacaag aacaacgagg 
1141 cagccaagcg gtcccgtgac gcccggcggc tcaaggagaa ccagatatcg gtgcgggcgg 
1201 ccttcctgga gaaggagaac gccctgctgc ggcaggaagt tgtggccgtg cgccaggagc 
1261 tgtcccacta ccgcgccgtg ctgtcccgat accaggccca gcacggggcc ctgtgaggct 
1321 gccccacatc cccacctggc ggagctctcc tccgccttgc tgagacttac gccctgttcc 
1381 cttcctgccc tgtggcccac gggccggcca gctgggtgcc ccagggacgt gataatgcag 
1441 ataaatacat ttatattttt aaaaaaaaaa aaaaaaaaaa 
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□ 1: BC063851 . Reports Homo sapiens comp...[gi:39645842] 
Comment Features Sequence 



Order cDNA clone, 
Links 



LOCUS BC063851 4015 bp mRNA linear PRI 15- JUL-2006 

DEFINITION Homo sapiens complement component 7, mRNA (cDNA clone MGC:75123 

IMAGE: 6184213) , complete cds . 
ACCESSION BC063851 
VERSION BC063851.1 GI:39645842 

KEYWORDS MGC . 

SOURCE Homo sapiens (human) 

ORGAN ISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

REFERENCE 1 (bases 1 to 4015) 

AUTHORS Strausberg,R.L. , Feingold, E . A. , Grouse, L.H., Derge,J.G., 

Klausner,R.D. , Collins, F. S . , Wagner, L., Shenmen,C.M. , Schuler, G. D. , 
Altschul,S.F. , Zeeberg,B., Buetow,K.H., Schaefer, C. F. , Bhat,N.K., 
Hopkins, R. F. , Jordan, H., Moore, T., Max,S.I., Wang, J., Hsieh,F., 
Diatchenko, L. , Marusina,K., Farmer, A. A., Rubin, G.M., Hong,L., 
Stapleton,M. , Soares,M.B., Bonaldo, M. F. , Casavant, T . L . , 
Scheetz, T.E. , Brown stein, M . J. , Usdin,T.B., Toshiyuki, S . , 
Carninci,P., Prange,C, Raha,S.S., Loquellano,N.A. , Peters , G . J. , 
Abramson,R.D. , Mullahy, S . J. , Bosak,S.A., McEwan,P.J., 
McKernan, K . J . , Malek,J.A., Gunaratne, P.H. , Richards, S., 
Worley,K.C, Hale,S., Garcia, A.M., Gay,L.J., Hulyk,S.W., 
Villalon, D.K. , Muzny, D.M. , Sodergren, E . J . , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E., Ketteman,M., Madan,A., Rodrigues, S . , 
Sanchez, A., Whiting, M., Madan,A., Young, A. C, Shevchenko, Y . , 
Bouffard,G.G. , Blakesley, R . W . , Touchman, J.W . , Green, E.D., 
Dickson, M. C. , Rodriguez, A. C. , Grimwood,J., Schmutz,J., Myers, R.M., 
Butterfield,Y.S., Krzywinski,M.I. , Skalska,U., Smailus, D.E. , 
Schnerch,A., Schein,J.E., Jones, S.J. and Marra,M.A. 
CONSRTM Mammalian Gene Collection Program Team 

TITLE Generation and initial analysis of more than 15,000 full-length 

human and mouse cDNA sequences 
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 
PUBMED 12477932 
REFERENCE 2 (bases 1 to 4015) 
CONSRTM NIH MGC Project 
TITLE Direct Submission 

JOURNAL Submitted (08-DEC-2003) National Institutes of Health, Mammalian 

Gene Collection (MGC), Bethesda, MD 20892-2590, USA 
REMARK NIH-MGC Project URL: http : //mqc . nci . nih . gov 
COMMENT Contact: MGC help desk 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. James R. Lupski 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Genome Sequence Centre, 

BC Cancer Agency, Vancouver, BC, Canada 

infoSbcgsc .be. ca 

Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson 
Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen 
Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel 
Brown- John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave 
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Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth 
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, 
Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR 
Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, 
Jacquie Schein, Asim Siddiqui, Steven Jones, Rob Holt, Marco Marra. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http : //image . llnl ■ gov 
Series: IRAK Plate: 141 Row: d Column: 14 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 45580687. 

Differences found between this sequence and the human reference 
genome (build 36) are described in misc_dif f erence features below. 
FEATURES Location/Qualifiers 
source 1..4015 

/organism="Homo sapiens" 
/mo l_typ e= "mRNA " 
/db_xref="taxon: 9606" 
/clone="MGC: 75123 IMAGE: 6184213" 

/tissue_type=" Peripheral Nervous System, dorsal root 
ganglion" 

/clone_lib="Lupski_dorsal_root_ganglion" 
/lab_host="DH10B" 
/note="Vector: pCMV-SPORT6" 
gene 1. . 4015 

/gene="C7" 

/db xref="GeneID: 730" 
/db xref="HGNC: 134 6 " 
/db_xref="MIM: 217070" 
CDS 96.. 2627 

/gene="C7" 
/codon_start=l 

/product="complement component 7" 
/protein id=" AAH63851 . 1 " 
/db_xref="GI: 39645843" 
/db xref="GeneID:730" 
/db xref="HGNC: 1346 " 
/ db_x re f= "MIM : 217070 " 

/translation="MKVISLFILVGFIGEFQSFSSASSPVNCQWDFYAPWSECNGCTK 
TQTRRRSVAVYGQYGGQPCVGNAFETQSCEPTRGCPTEEGCGERFRCFSGQCISKSLV 
CNGDSDCDEDSADEDRCEDSERRPSCDIDKPPPNIELTGNGYNELTGQFRNRVINTKS 
FGGQCRKVFSGDGKDFYRLSGNVLSYTFQVKINNDFNYEFYNSTWSYVKHTSTEHTSS 
SRKRSFFRSSSSSSRSYTSHTNEIHKGKSYQLLWENTVEVAQFINNNPEFLQLAEPF 
WKELSHLPSLYDYSAYRRLIDQYGTHYLQSGSLGGEYRVLFYVDSEKLKQNDFNSVEE 
KKCKSSGWHFVVKFSSHGCKELENALKAASGTQNNVLRGEPFIRGGGAGFISGLSYLE 
LDNPAGNKRRYSAWAESVTNLPQVIKQKLTPLYELVKEVPCASVKKLYLKWALEEYLD 
EFDPCHCRPCQNGGLATVEGTHCLCHCKPYTFGAACEQGVLVGNQAGGVDGGWSCWSS 
WSPCVQGKKTRSRECNNPPPSGGGRSCVGETTESTQCEDEELEHLRLLEPHCFPLSLV 
PTEFCPSPPALKDGFVQDEGTMFPVGKNVVYTCNEGYSLIGNPVARCGEDLRWLVGEM 
HCQKIACVLPVLMDGIQSHPQKPFYTVGEKVTVSCSGGMSLEGPSAFLCGSSLKWSPE 
MKNARCVQKENPLTQAVPKCQRWEKLQNSRCVCKMPYECGPSLDVCAQDERSKRILPL 
TVCKMHVLHCQGRNYTLTGRDSCTLPASAEKACGACPLWGKCDAESSKCVCREASECE 
EEGFS ICVEVNGKEQTMSECEAGALRCRGQS I SVTS IRPCAAETQ" 
misc difference 3994.. 4015 
/gene="C7" 

/note="polyA tail: 22 bases do not align to the human 
genome . " 

ORIGIN 

1 ggcagcctgc tgggctcttc ctgctgttga aaacttaccc ggcccttaca gaggaaatct 
61 tcctcctctc ttctgccctg aatgttttcc caaacatgaa ggtgataagc ttattcattt 
121 tggtgggatt tataggagag ttccaaagtt tttcaagtgc ctcctctcca gtcaactgcc 
181 agtgggactt ctatgcccct tggtcagaat gcaatggctg taccaagact cagactcgca 
241 ggcggtcagt tgctgtgtat gggcagtatg gaggccagcc ttgtgttgga aatgcttttg 
301 aaacacagtc ctgtgaacct acaagaggat gtccaacaga ggagggatgt ggagagcgtt 
361 tcaggtgctt ttcaggtcag tgcatcagca aatcattggt ttgcaatggg gattctgact 
421 gtgatgaaga cagtgctgat gaagacagat gtgaggactc agaaaggaga ccttcctgtg 
481 atatcgataa acctcctcct aacatagaac ttactggaaa tggttacaat gaactcactg 
541 gccagtttag gaacagagtc atcaatacca aaagttttgg tggtcaatgt agaaaggtgt 
601 ttagtgggga tggaaaagat ttctacaggc tgagtggaaa tgtcctgtcc tatacattcc 
661 aggtgaaaat aaataatgat tttaattatg aattttacaa tagtacttgg tcttatgtaa 
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721 aacatacgtc gacagaacac acatcatcta gtcggaagcg ctcctttttt agatcttcat 
781 catcttcttc acgcagttat acttcacata ccaatgaaat ccataaagga aagagttacc 
841 aactgctggt tgttgagaac actgttgaag tggctcagtt cattaataac aatccagaat 
901 ttttacaact tgctgagcca ttctggaagg agctttccca cctcccctct ctgtatgact 
961 acagtgccta ccgaagatta atcgaccagt acgggacaca ttatctgcaa tctgggtcgt 
1021 taggaggaga atacagagtt ctattttatg tggactcaga aaaattaaaa caaaatgatt 
1081 ttaattcagt cgaagaaaag aaatgtaaat cctcaggttg gcattttgtc gttaaatttt 
1141 caagtcatgg atgcaaggaa ctggaaaacg ctttaaaagc tgcttcagga acccagaaca 
1201 atgtattgcg aggagaaccg ttcatcagag ggggaggtgc aggcttcata tctggcctta 
1261 gttacctaga gctggacaat cctgctggaa acaaaaggcg atattctgcc tgggcagaat 
1321 ctgtgactaa tcttcctcaa gtcataaaac aaaagctgac acctttatat gagctggtaa 
1381 aggaagtacc ttgtgcctct gtgaaaaaac tatacctgaa atgggctctt gaagagtatc 
1441 tggatgaatt tgacccctgt cattgccggc cttgtcaaaa tggtggtttg gctactgttg 
1501 aggggaccca ttgtctgtgc cattgcaaac cgtacacatt tggtgcggcg tgtgagcaag 
1561 gagtcctcgt agggaatcaa gcaggagggg ttgatggagg ttggagttgc tggtcctctt 
1621 ggagcccctg tgtccaaggg aagaaaacaa gaagccgtga atgcaataac ccacctccca 
1681 gtgggggtgg gagatcctgc gttggagaaa cgacagaaag cacacaatgc gaagatgagg 
1741 agctggagca cttgaggttg cttgaaccac attgctttcc tttgtctttg gttccaacag 
1801 aattctgtcc atcacctcct gccttgaaag atggatttgt tcaagatgaa ggtacaatgt 
1861 ttcctgtggg gaaaaatgta gtgtacactt gcaatgaagg atactctctt attggaaacc 
1921 cagtggccag atgtggagaa gatttacggt ggcttgttgg ggaaatgcat tgtcagaaaa 
1981 ttgcctgtgt tctacctgta ctgatggatg gcatacagag tcacccccaa aaacctttct 
2041 acacagttgg tgagaaggtg actgtttcct gttcaggtgg catgtcctta gaaggtcctt 
2101 cagcatttct ctgtggctcc agccttaagt ggagtcctga gatgaagaat gcccgctgtg 
2161 tacaaaaaga aaatccgtta acacaggcag tgcctaaatg tcagcgctgg gagaaactgc 
2221 agaattcaag atgtgtttgt aaaatgccct acgaatgtgg accttccttg gatgtatgtg 
2281 ctcaagatga gagaagcaaa aggatactgc ctctgacagt ttgcaagatg catgttctcc 
2341 actgtcaggg tagaaattac acccttactg gtagggacag ctgtactctg cctgcctcag 
2401 ctgagaaagc ttgtggtgcc tgcccactgt ggggaaaatg tgatgctgag agcagcaaat 
2461 gtgtctgccg agaagcatcg gagtgcgagg aagaagggtt tagcatttgt gtggaagtga 
2521 acggcaagga gcagacgatg tctgagtgtg aggcgggcgc tctgagatgc agagggcaga 
2581 gcatctctgt caccagcata aggccttgtg ctgcggaaac ccagtaggct cctggaggcc 
2641 ctggtcagct tgcttggaat ccagcaggca gctggggctg agtgaaaaca tctgcacaac 
2701 tgggcactgg acagcttttc cttcttctcc agtgtctacc ttcctcctca actcccagcc 
27 61 atctgtataa acacaatcct ttgttctccc aaatctgaat cgaattactc ttttgcctcc 
2821 tttttaatgt cagtaaggat atgagccttt gcacaggctg gctgcgtgtt cttgaaatag 
2881 gtgttacctt ctctgggcct tggtttttta aaatctgtaa aattagagga ttgcactaga 
2941 gaaacttgaa tgctccattc aggcctatca ttttattaag tatgattgac acagcccatg 
3001 ggccagaaca cactctacaa aatgactagg ataacagaaa gaacgtgatc tcctgattag 
3061 agagggtggt tttcctcaat ggaaccaaat ataaagagga cttgaacaaa aatgacagat 
3121 acaaactatt tctatcctga gtagtaatct cacacttcat cctatagagt caaccaccac 
3181 agataggaat tccttattct ttttttaatt tttttaagac agagtctcac tttgttgccc 
3241 aggctggagc gcagtggggt gatctcatct ccctgcaacc tccgcctcct gggttcaagc 
3301 gattcttgtg cctcagcttc ccaagcagct gggattacag gtgcccgcca ccacgcccag 
3361 ctaatttttg catttttagt agagatgggg tttcaccatg ttggccacgc tcgtctccaa 
3421 ctcctgacct caggtaatcc gcctgccttg gcctcccaaa gtgctgggat tacagacatg 
3481 aaccaccacg cctggctgga atacttactc ttgtcgggag attgaaccac taaaatgtta 
3541 gagcagaatt cattatgctg tggtcacagg ggtgtcttgt ctgagaacaa atacaattca 
3601 gtcttctctt tggggtttta gtatgtgtca aacataggac tggaagtttg cccctgttct 
3661 tttttctttt gaaagaacat cagttcatgc ctgaggcatg agtgactgtg catttgagaa 
3721 tagttttccc tattctgtgg atacagtccc agagttttca gggagtacac aggtagatta 
3781 gtttgaagca ttgacctttt atttattcct tatttctctt tcatcaaaac aaaacagcag 
3841 ctgtgggagg agaaatgaga gggcttaaat gaaatttaaa ataagctata ttatacaaat 
3901 actatctctg tattgttctg accctggtaa atatatttca aaacttcaga tgacaaggat 
3961 tagaacactc attaaagatg ctattcttca gaaaaaaaaa aaaaaaaaaa aaaaa 
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[Jl: BC002788 . Reports Homo sapiens plas...[gi:338771 82] 
Comment Features Sequence 

LOCUS BC002788 1353 bp mRNA linear PRI 15-JUL-2006 

DEFINITION Homo sapiens plasminogen activator, urokinase receptor, mRNA (cDNA 

clone MGC:3905 IMAGE: 3617894) , complete cds . 
ACCESSION BC002788 
VERSION BC002788.2 GI:33877182 

KEYWORDS MGC . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

REFERENCE 1 (bases 1 to 1353) 

AUTHORS Strausberg,R.L. , Feingold, E . A. , Grouse, L.H., Derge,J.G., 

Klausner, R. D. , Collins, F.S. , Wagner, L. , Shenmen, C .M. , Schuler,G.D. , 
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer, C. F. , Bhat,N.K., 
Hopkins, R. F. , Jordan, H . , Moore, T., Max,S.I., Wang, J., Hsieh,F., 
Diatchenko, L. , Marusina,K., Farmer, A. A., Rubin, G.M., Hong,L., 
Stapleton,M. , Soares,M.B., Bonaldo, M. F. , Casavant, T.L. , 
Scheetz,T.E. , Browns tein, M. J . , Usdin,T.B., Toshiyuki, S . , 
Carninci,P., Prange,C, Raha,S.S., Loquellano,N.A. , Peters, G. J., 
Abramson,R.D. , Mullahy, S . J. , Bosak,S.A., McEwan,P.J., 
McKernan, K. J. , Malek,J.A., Gunaratne, P.H. , Richards, S., 
Worley,K.C, Hale,S., Garcia, A.M., Gay,L.J., Hulyk,S.W., 
Villalon, D.K. , Muzny, D.M. , Sodergren, E . J. , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E., Ketteman,M., Madan,A., Rodrigues, S . , 
Sanchez, A., Whiting, M., Madan,A., Young, A. C . , Shevchenko, Y . , 
Bouf fard, G.G. , Blakesley, R.W. , Touchman, J.W. , Green, E.D., 
Dickson, M. C . , Rodriguez, A. C. , Grimwood,J., Schmutz,J., Myers, R.M., 
Butterfield,Y.S., Krzywinski, M. I . , Skalska,U., Smailus, D.E. , 
Schnerch,A., Schein,J.E., Jones, S.J. and Marra,M.A. 
CONSRTM Mammalian Gene Collection Program Team 

TITLE Generation and initial analysis of more than 15,000 full-length 

human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 
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12477932 
2 (bases 1 to 1353) 
NIH MGC Project 
Direct Submission 

Submitted (05-FEB-2001) National Institutes of Health, Mammalian 

Gene Collection (MGC) , Bethesda, MD 20892-2590, USA 

NIH-MGC Project URL: http://mqc.nci.nih.gov 

On Aug 19, 2003 this sequence version replaced qi: 12803884 . 

Contact: MGC help desk 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: ATCC/DCTD/DTP 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: National Institutes of Health Intramural 

Sequencing Center (NISC) , 

Gaithersburg, Maryland; 

Web site: http : //www . nisc . nih . gov/ 

Contact: nisc_mgc@nhgri.nih.gov 

Akhter,N., Ayele,K., Beckstrom-Sternberg, S .M. , Benjamin, B., 
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Blakesley, R.W. , Bouffard,G.G. , Breen,K., Brinkley, C . , Brooks, S., 
Dietrich, N. L. , Granite, S., Guan,X., Gupta, J. , Haghighi,P., 
Hansen, N . , Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., 
Maduro,Q.L., Masiello,C, Maskeri,B., Mastrian, S . D. , McCloskey, J. C . , 
McDowell, J. , Pearson, R. , Stantripop, S . , Thomas, P.J. , Touchman, J.W . , 
Tsurgeon,C, Vogt,J.L., Walker, M. A. , Wetherby, K. D. , Wiggins, L., 
Young, A. , Zhang, L.-H. and Green, E.D. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http : //image . llnl ■ gov 
Series: IRAL Plate: 12 Row: j Column: 13 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 53829377. 

Differences found between this sequence and the human reference 
genome (build 36) are described in misc_dif f erence features below. 
FEATURES Location/Qualifiers 
source 1 . . 1353 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db xref="taxon: 9606 " 

/clone="MGC:3905 IMAGE : 36178 94" 

/tissue_type="Skin, melanotic melanoma." 

/clone_lib="NIH_MGC_20" 

/lab_host="DH10B-R" 

/note="Vector: pOTB7" 
gene 1..1353 

/gene="PLAUR" 

/gene_synonym="UPAR" 

/gene_synonym="URKR" 

/gene_synonym="CD87 " 

/db_xref="GeneID: 5329 " 

/db xref="HGNC: 9053 " 

/db_x re f= "MIM : 173391 " 
CDS 66.. 1073 

/gene="PLAUR" 

/gene_synonym="UPAR" 

/gene_synonym="URKR" 

/gene_synonym="CD87" 

/codon_start=l 

/product="plasminogen activator, urokinase receptor" 

/protein id=" AAH02788 . 1 " 

/db_xref="GI : 12803885" 

/ db_x r e f = " Gene I D : 5329 " 

/db xref="HGNC: 9053 " 

/db xref="MIM: 173391 " 

/translation="MGHPPLLPLLLLLHTCVPASWGLRCMQCKTNGDCRVEECALGQD 
LCRTTIVRLWEEGEELELVEKSCTHSEKTNRTLSYRTGLKITSLTEVVCGLDLCNQGN 
SGRAVTYSRSRYLECISCGSSDMSCERGRHQSLQCRSPEEQCLDVVTHWIQEGEEGRP 
KDDRHLRGCGYLPGCPGSNGFHNNDTFHFLKCCNTTKCNEGPILELENLPQNGRQCYS 
CKGNSTHGCSSEETFLIDCRGPMNQCLVATGTHEPKNQSYMVRGCATASMCQHAHLGD 
AFSMNHIDVSCCTKSGCNHPDLDVQYRSGAAPQPGPAHLSLTITLLMTARLWGGTLLW 
T" 

misc difference 1336.. 1353 

/gene="PLAUR" 
/gene_synonym="UPAR" 
/gene_synonym="URKR" 
/gene_synonym="CD87" 

/note="polyA tail: 18 bases do not align to the human 
genome . " 

ORIGIN 

1 ccttcctgag gccagaagga gagaagacgt gcagggaccc cgcgcacagg agctgccctc 
61 gcgacatggg tcacccgccg ctgctgccgc tgctgctgct gctccacacc tgcgtcccag 
121 cctcttgggg cctgcggtgc atgcagtgta agaccaacgg ggattgccgt gtggaagagt 
181 gcgccctggg acaggacctc tgcaggacca cgatcgtgcg cttgtgggaa gaaggagaag 
241 agctggagct ggtggagaaa agctgtaccc actcagagaa gaccaacagg accctgagct 
301 atcggactgg cttgaagatc accagcctta ccgaggttgt gtgtgggtta gacttgtgca 
361 accagggcaa ctctggccgg gctgtcacct attcccgaag ccgttacctc gaatgcattt 
421 cctgtggctc atcagacatg agctgtgaga ggggccggca ccagagcctg cagtgccgca 
481 gccctgaaga acagtgcctg gatgtggtga cccactggat ccaggaaggt gaagaagggc 
541 gtccaaagga tgaccgccac ctccgtggct gtggctacct tcccggctgc ccgggctcca 
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601 atggtttcca caacaacgac accttccact tcctgaaatg ctgcaacacc accaaatgca 

661 acgagggccc aatcctggag cttgaaaatc tgccgcagaa tggccgccag tgttacagct 

721 gcaaggggaa cagcacccat ggatgctcct ctgaagagac tttcctcatt gactgccgag 

781 gccccatgaa tcaatgtctg gtagccaccg gcactcacga accgaaaaac caaagctata 

841 tggtaagagg ctgtgcaacc gcctcaatgt gccaacatgc ccacctgggt gacgccttca 

901 gcatgaacca cattgatgtc tcctgctgta ctaaaagtgg ctgtaaccac ccagacctgg 

961 atgtccagta ccgcagtggg gctgctcctc agcctggccc tgcccatctc agcctcacca 

1021 tcaccctgct aatgactgcc agactgtggg gaggcactct cctctggacc taaacctgaa 

1081 atccccctct ctgccctggc tggatccggg ggaccccttt gcccttccct cggctcccag 

1141 ccctacagac ttgctgtgtg acctcaggcc agtgtgccga cctctctggg cctcagtttt 

1201 cccagctatg aaaacagcta tctcacaaag ttgtgtgaag cagaagagaa aagctggagg 

1261 aaggccgtgg gccaatggga gagctcttgt tattattaat attgttgccg ctgttgtgtt 

1321 gttgttatta attaaaaaaa aaaaaaaaaa aaa 
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BC0Q5235 . Reports Homo sapiens nucl...[gi:13528872] 
Comment Features Sequence 

LOCUS BC005235 1199 bp mRNA linear PRI 15-JUL-2006 

DEFINITION Homo sapiens nuclear DNA-binding protein, mRNA (cDNA clone 

MGC:12261 IMAGE : 3930648 } , complete cds. 
ACCESSION BC005235 
VERSION BC005235.1 GI: 13528872 

KEYWORDS MGC. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

REFERENCE 1 (bases 1 to 1199) 

AUTHORS Strausberg, R.L. , Feingold, E.A. , Grouse, L.H., Derge,J.G., 

Klausner,R.D. , Collins, F. S . , Wagner, L . , Shenmen, CM. , Schuler,G.D. , 
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer, C. F. , Bhat,N.K., 
Hopkins, R. F. , Jordan, H., Moore, T., Max,S.I., Wang,J., Hsieh,F., 
Diatchenko,L. , Marusina,K., Farmer, A. A., Rubin, G.M., Hong,L., 
Stapleton,M. , Soares,M.B., Bonaldo,M. F. , Casavant,T.L. , 
Scheetz,T.E. , Browns tein, M. J. , Usdin,T.B., Toshiyuki, S . , 
Carninci,P., Prange,C, Raha,S.S., Loquellano, N . A. , Peters, G.J., 
Abramson,R.D. , Mullahy, S . J. , Bosak,S.A., McEwan,P.J., 
McKernan, K . J . , Malek, J.A., Gunaratne, P.H. , Richards, S., 
Worley,K.C, Hale,S., Garcia, A.M., Gay,L.J., Hulyk,S.W., 
Villalon, D.K. , Muzny, D.M. , Sodergren, E. J. , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E., Ketteman,M., Madan,A., Rodrigues, S . , 
Sanchez , A. , Whiting, M., Madan,A., Young, A. C. , Shevchenko, Y . , 
Bouffard,G.G. , Blakesley, R.W. , Touchman, J.W. , Green, E.D., 
Dickson, M.C. , Rodriguez, A. C. , Grimwood, J. , Schmutz,J., Myers, R.M., 
Butterfield, Y.S. , Krzywinski, M. I . , Skalska,U., Smailus , D . E . , 
Schnerch,A., Schein,J.E., Jones, S.J. and Marra,M.A. 
CONSRTM Mammalian Gene Collection Program Team 

TITLE Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 
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PUBMED 
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12477932 

2 (bases 1 to 1199) 
NIH MGC Project 
Direct Submission 
Submitted (27-MAR-2001; 
Gene Collection (MGC) 



National Institutes of Health, Mammalian 
Bethesda, MD 20892-2590, USA 



NIH-MGC Project URL: http://mqc.nci.nih.gov 
Contact: MGC help desk 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: CLONTECH Laboratories, Inc. 
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Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http : / /image . llnl . gov 
Series: IRAL Plate: 16 Row: 1 Column: 7 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 27894372. 

Differences found between this sequence and the human reference 
genome (build 36) are described in miscjdif ference features below 
and these differences were also compared to chimpanzee genome 
(build 2) . 

FEATURES Location/Qualifiers 
source 1..1199 

/organism="Homo sapiens" 

/mo l_t yp e= "mRNA " 

/db_xref="taxon:9606" 

/clone="MGC : 122 61 IMAGE : 3930 64 8 " 

/tissue_type="Lung, mucoepidermoid carcinoma" 

/clone_lib="NIH_MGC_59" 

/lab_host="DH10B" 

/note="Vector: pDNR-LIB" 
gene 1. .1199 

/gene="ClD" 

/gene_synonym="MGC12261" 
/gene_synonym="MGC14 659" 
/gene_synonym="SUNCOR" 
/db_xref ="GeneID : 10438 " 
/db_xre f ="MIM : 606997 " 
CDS 64. .489 

/gene="ClD" 

/gene_synonym="MGC12261" 
/gene_synonym="MGC14 659" 
/gene_synonym="SUNCOR" 
/codon_start=l 

/product="nuclear DNA-binding protein" 
/protein id=" AAH05235 . 1 " 
/db_xref="GI: 13528873" 
/ db_x r e f = " Ge ne I D : 10438 " 
/db_xref="MIM: 606997" 

/translation="MAGEEINEDYPVEIHEYLSAFENSIGAVDEMLKTMMSVSRNELL 
QKLDPLEQAKVDLVSAYTLNSMFWVYLATQGVNPKEHPVKQELERIRVYMNRVKEITD 
KKKAGKLDRGAASRFVKNALWEPKSKNASKVANKGKSKS" 
misc difference 480 

/gene="ClD" 

/gene_synonym="MGC122 61" 
/gene_synonym="MGC14659" 
/gene_synonym="SUNCOR" 

/note="'C in cDNA is ' T ' in the human genome; no amino 
acid change. The chimpanzee genome agrees with the human 
genomic sequence and not the cDNA." 
misc difference 548*549 

/gene="ClD" 

/gene_synonym="MGC122 61" 
/gene_synonym="MGC14 659" 
/gene_synonym="SUNCOR" 

/note="l base in the human genome, C, is not found in 
cDNA. The chimpanzee genome agrees with the cDNA 
sequence, suggesting that this difference is unlikely to 
be due to an artifact." 
misc difference 892 

/gene="ClD" 

/ gene_s ynonym= "MGC1 2 2 61" 
/gene_synonym="MGC14 659" 
/gene_synonym="SUNCOR" 

/note="'C in cDNA is 'T ' in the human genome. The 
chimpanzee genome agrees with the human genomic sequence 
and not the cDNA. " 
misc difference 1169. .1199 
/gene="ClD" 

/gene_synonym="MGC12261" 
/ gene_synonym= "MGC 1 4 6 5 9 " 
/gene_synonym="SUNCOR" 
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/note="polyA tail: 31 bases do not align to the human 
genome . " 

ORIGIN 

1 gcccgaccgc tttccgggag actggagtcg aaggccgtga ggtatttttc taagtcagcc 
61 ataatggcag gtgaagaaat taatgaagac tatccagtag aaattcacga gtatttgtca 
121 gcgtttgaga attccattgg tgctgtggat gagatgctga agaccatgat gtctgtttct 
181 agaaatgagt tgttgcagaa gttggatcca cttgaacaag caaaagtgga tttggtttct 
241 gcatacacat taaattcaat gttttgggtt tatttggcaa cccaaggagt taatcctaag 
301 gaacatccag taaaacagga attggaaaga atcagagtat atatgaacag agtcaaggaa 
361 ataacagaca agaaaaaggc tggcaagctg gacagaggtg cagcttcaag atttgtaaaa 
421 aatgccctct gggaaccaaa atcgaaaaat gcatcaaaag ttgccaataa aggaaaaagc 

-4-8-1— aaaagt-ta-aeH^ttfefeggfetefe- tega^-fea-sa-G-a-tea-t-tGaaa-a— ag4^acatctt ccccc c cccc ; 

541 ccccccccgc aaaataattc tgtggcaggg caaggtttaa atgtgtttct tattaatatg 
601 taaattcaca gtaaatatgt aaagctaaat actttcctct ccaaagatca ttatctttat 
661 tgattagcac tgaggatttt aacattgtga tatattatat atttataatt taccatctct 
721 tgatgagact cttatttctt tatataggtc agtcttgcaa gtaccatttt ataagcagct 
781 gtgaaattta agtgaaatgt tctttgtaaa catttgtact attttaaatg aataatgacc 
841 ttatgaagta tgctatctgt aggctgaaat tataggtaca tctgttttca ccatatgata 
901 ttaagaaagc gtgaaatgac ttaaatgttc atttttttct gtatagatac tttatcatgt 
961 tttcatgatt ttaggaatta ctgctttgtt gatattcaaa gtgtgaaact aaaactttat 
1021 ggttgtactt taattcttgg catgttgcct ctatgtccca tttaaaataa aatacattct 
1081 cattaacttt agatgggaaa taaggttgta tgttgatgga tgaattttgg catgatgact 
1141 gtactctcaa taaaggctga aaatgttgca aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 
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r i: AF060181 . Reports Homo sapiens zinc... [gi:3228539] 
Features Sequence 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



AUTHORS 
TITLE 



AUTHORS 

TITLE 

JOURNAL 



AF060181 4440 bp mRNA linear PRI 16-JUN-1998 

Homo sapiens zinc finger protein (ZNF198) mRNA, complete cds . 
AF060181 

AF060181.1 GI:3228539 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

1 (bases 1 to 4440) 
Still, I. H. and Cowell,J.K. 

The t(8;13) atypical myeloproliferative disorder: further analysis 
of the ZNF198 gene and lack of evidence for multiple genes 
disrupted on chromosome 13 
Unpublished 

2 (bases 1 to 4440) 
Still, I. H. 

Direct Submission 

Submitted ( 16-APR-1998 ) Neurosciences, Cleveland Clinic Foundation, 
9500 Euclid Avenue, Cleveland, OH 44195, USA 

Location/Qualifiers 
■ 1..4440 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606 " 

/chromosome="13" 

/map="13qll-12" 

1. .4440 

/gene="ZNF198" 
341. .3682 
/gene="ZNF198" 
/codon_start=l 

/product="zinc finger protein" 
/protein_id=" AAC23591 .1 " 
/db_xref="GI: 3228540" 

/translation="MWQETFFRMENLQLIIILILGSPSQLHFPVIRNNQGWTLYHQWP 
HFLNRFSSRLSNSSLLNQLKFTCANCKKPLQKGQTAYQRKGSAHLFCSTTCLSSFSHK 
PAPKKLCVMCKKDITTMKGTIVAQVDSSESFQEFCSTSCLSLYEDKQNPTKGALNKSR 
CTICGKLTEIRHEVSFKNMTHKLCSDHCFNRYRMANGLIMNCCEQCGEYLPSKGAGNN 
VLVIDGQQKRFCCQSCVSEYKQVGSHPSFLKEVRDHMQDSFLMQPEKYGKLTTCTGCR 
TQCRFFDMTQCIGPNGYMEPYCSTACMNSHKTKYAKSQSLGIICHFCKRNSLPQYQAT 
MPDGKLYNFCNSSCVAKFQALSMQSSPNGQFVAPSDIQLKCNYCKNSFCSKPEILEWE 
NKVHQFCSKTCSDDYKKLHCIVTYCEYCQEEKTLHETVNFSGVKRPFCSEGCKLLYKQ 
DFARRLGLRCVTCNYCSQLCKKGATKELDGVVRDFCSEDCCKKFQDWYYKAARCDCCK 
SQGTLKERVQWRGEMKHFCDQHCLLRFYCQQNEPNMTTQKGPENLHYDQGCQTSRTKM 
TGSAPPPSPTPNKEMKNKAVLCKPLTMTKATYCKPHMQTKSCQTDDTWRTEYVPVPIP 
VPVY I PVPMHMYSQNI PVPTTVPVPVPVPVFLPAPLDS SEKI PAAI EELKSKVS S DAL 
DTELLTMTDMMSEDEGKTETTNINSVI IETDI IGSDLLKNSDPETQSSMPDVPYEPDL 
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DIEIDFPRAAEELDMENEFLLPPVFGEEYEEQPRPRSKKKGAKRKAVSGYQSHDDSSD 
NSECSFPFKYTYGVNAWKHWVKTRQLDEDLLVLDELKSSKSVKLKEDLLSHTTAELNY 
GLAHFVNEIRRPNGENYAPDSIYYLCLGIQEYLCGSNRKDNIFIDPGYQTFEQELNKI 
LRSWQPSILPDGSIFSRVEEDYLWRIKQLGSHSPVALLNTLFYFNTKYFGLKTVEQHL 
RLSFGTVFRHWKKNPLTMENKACLRYQVSSLCGTDNEDKITTGKRKHEDDEPVFEQIE 
NTANPSRCPVKMFECYLSKSPQNLNQRMDVFYLQPECSSSTDSPVWYTSTSLDRNTLE 
NMLVRVLLVKDIYDKDNYELDEDTD" 

ORIGIN 

1 gatgtggatt tctccacttc cagtttttca agaagtaagg taaatgcagg aatgggtaat 
61 agtggtatca ccacagaacc agactctgaa attcagattg ctaatgttac aactttagaa 

121 acaggtgtaa gctctgtgaa tqatqqccaa ttaqaaaata ctqacqqqcq aqatatgaac 

181 ttaatgatta cacatgtaac atcactgcag aataccaact tgggagatgt ctctaacgga 
241 ctgcagtcaa gtaattttgg tgttaatata caaacataca ccccatcttt aacttcacag 
301 accaagactg gagtaggacc ttttaatcct ggtagaatga atgtggcagg agacgttttt 
361 cagaatggag aatctgcaac tcatcataat cctgattctt ggatctccca gtcagcttca 
421 tttccccgta atcagaaaca accaggggtg gactctttat caccagtggc ctcacttcct 
481 aaacagattt tccagccgtc tgtccaacag cagcctacta aaccagttaa agttcacttg 
541 tgcaaactgc aaaaaacctt tacagaaggg ccagacagct tatcaacgaa aaggatcagc 
601 tcacctcttt tgttctacca cctgcctttc ttccttctcc cacaagcctg ctccaaagaa 
661 actctgtgtt atgtgtaaaa aagatataac tacaatgaaa ggaaccattg ttgctcaagt 
721 ggattcaagt gagtccttcc aggaattctg tagtacatct tgtttatctc tctatgaaga 
781 caaacagaat cctactaaag gagctctaaa taaatcaaga tgtacaatct gtggtaaact 
841 aactgagatt cgccatgaag tcagctttaa aaatatgact cataagctgt gcagtgacca 
901 ctgctttaat agatatagaa tggccaatgg tttaataatg aattgctgtg aacagtgtgg 
961 agagtacttg cccagtaaag gtgctggaaa taatgttctg gtgattgatg gtcaacagaa 
1021 aagattttgc tgtcaaagtt gtgtcagtga atacaaacag gtaggtagcc atccaagctt 
1081 cctgaaggag gttcgagatc acatgcagga ctcttttttg atgcagcctg agaaatatgg 
1141 aaaactgaca acttgtactg gttgccgaac acagtgcagg ttttttgata tgactcagtg 
1201 tataggtcct aatggatata tggagccata ttgttcaact gcttgtatga acagtcacaa 
1261 gacaaaatat gcaaaatcac aaagtttggg aattatttgc catttttgta agcgaaactc 
1321 tttacctcaa taccaagcca caatgcctga tggaaaactg tacaactttt gcaattccag 
1381 ttgtgtggct aaatttcagg ctctaagtat gcagtcatct ccaaatggcc agtttgtagc 
1441 gccaagtgat attcagttga aatgcaacta ctgcaaaaat tccttttgtt caaaaccaga 
1501 aatcctggaa tgggagaaca aagtgcatca gttctgcagc aaaacttgtt cagatgacta 
1561 taagaagttg cattgcatag ttacatattg cgaatactgt caagaggaga agactcttca 
1621 tgaaacagta aatttctctg gcgttaagag acctttctgt agtgaaggct gcaaattatt 
1681 atacaaacag gattttgcca gacgtttagg attgagatgt gttacttgca actattgttc 
1741 tcagctatgt aagaagggag caactaaaga actcgatggt gttgtgagag atttctgcag 
1801 tgaagattgc tgtaaaaaat ttcaggattg gtactacaag gctgcaaggt gtgactgttg 
1861 taaatctcaa ggaactctta aagagcgagt tcagtggcgt ggggaaatga aacatttctg 
1921 tgatcaacat tgcttactgc gtttctactg tcaacaaaat gagcccaaca tgacaactca 
1981 gaaaggacct gaaaacttac attatgatca gggttgtcag acatctcgaa ccaaaatgac 
2041 aggttcagca ccaccccctt ctccaacacc taacaaagag atgaagaaca aagcagttct 
2101 ttgcaaacct ttaacaatga caaaagctac ttactgtaaa cctcacatgc agaccaaatc 
2161 ttgtcagaca gatgatactt ggaggacaga atatgttcca gtgcctatcc ctgtgcctgt 
2221 gtatatccca gttcctatgc acatgtacag tcagaatatt cctgttccta ctacagttcc 
2281 tgttcctgtg ccagttcctg tttttctgcc tgctccattg gacagcagtg agaagattcc 
2341 tgcagcaatt gaggagctaa aaagcaaggt ttcttcagat gctcttgata cagagttgct 
2 4 01 tacaatgacg gatatgatga gtgaagacga ggggaaaaca gagacaacca acatcaacag 
24 61 tgtaattatt gaaacagata taattggttc agaccttttg aagaactctg acccagagac 
2521 acagtccagc atgcctgatg taccatatga accagatttg gatatcgaaa tagattttcc 
2581 cagagctgct gaggagcttg atatggaaaa tgaattttta ttaccacctg tttttggcga 
2641 agaatatgag gaacagccca gacctcgatc taaaaaaaag ggagccaaga gaaaggctgt 
2701 atcaggatac cagtctcatg atgatagttc tgacaattca gaatgcagct ttcctttcaa 
2761 atatacgtat ggcgtaaatg catggaaaca ctgggtcaaa actaggcaac ttgatgaaga 
2821 tcttctggta ttagatgagt taaaatcttc taaatcagta aagttaaaag aggatctact 
2881 ctctcacacc acagctgagc ttaactatgg gttagctcat tttgtcaatg agatccgacg 
2941 gccaaatgga gagaattatg cacctgacag catctattac ctttgccttg gaatacagga 
3001 gtatttgtgt ggaagtaatc gaaaagacaa catatttatt gatcctggat accaaacatt 
3061 tgagcaagaa ttgaataaaa tactgcgaag ctggcaacca agcatacttc cagatgggtc 
3121 aatattctct cgagttgaag aagactatct ctggaggata aaacaactag gatcacactc 
3181 tccagtagct cttctgaata cactgttcta ctttaacact aagtattttg gcctgaaaac 
3241 agtggaacaa cacttaagac tttcctttgg cactgtgttt aggcattgga aaaaaaatcc 
3301 tttaacgatg gaaaacaaag cgtgtcttcg ataccaagtg tcttccttgt gtggaacaga 
33 61 taatgaagat aaaattacta ctggaaaaag aaaacatgaa gatgatgagc cagtatttga 
3421 acaaattgaa aacacagcca atccttccag atgtcctgtg aaaatgtttg aatgctactt 
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3481 gtctaaaagt ccacagaatc ttaatcagag gatggatgtt ttttatttgc aaccagaatg 
3541 ctctagttct acagatagcc ctgtctggta tacgtctact tcactggacc gaaacacctt 
3601 ggaaaatatg cttgtacggg ttcttctagt aaaagatatt tatgataaag acaattatga 
3661 actggatgaa gacacagact aaaaaggaac gttgcagaag caatcgggat aaaacagcat 
3721 tagatagtca tgctgctaga tctttattat ggaaaacatt tcaagtttac tccttctgtt 
3781 ttgagttttg tagcagtgta cccacgctgg gtattaccat gtaaataatc tgtgagtgaa 
3841 agttgccatt attctatgta gtggttttag gatacttaac aaatacattc aaattctttt 
3901 tttattatta tttatttgat taggtatgtt tgtaactttt tacattacag aatatgaatg 
3961 agaatgtgcc atgtataatt tttttcttgt agtaagaaac atccatattg cacaactcta 
4021 ctgttgcaaa gcttccttgg aagggggctc ttttactggg ttcttaacca gatggttgtg 
4081 tatgggtagi- ^p)- qrfaaaa gtt-t-agaart f gr-.agtg tr.t ttcg gaattt ttaaaataaa 
4141 ctgtaaacta ataggctggg gtttttgttt tgttttgggg ttttgttttg tttggtttta 
4201 cattttagtt actgaagcct tacaaggtta tgtagagaga taccatcttc tgtaccaaaa 
4261 atagacaaga gaatgctgtc aatattggtg tactgtaatg tgaatctatg ctggtgaaaa 
4321 caattttttg ttccccttat taaaacctta gtgtcctttt ctcatttgtg gctttctgca 
4381 tcacccaatc aataaaaaac aaatatatat atgtatataa cctccttcag gaagcttgga 
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E~ 1: AF237631 . Reports Homo sapiens ubiq...[gi:7288860] 
Features Sequence 



LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



AF237631 2072 bp mRNA linear PRI 23-MAR-2000 

Homo sapiens ubiquitous tropomodulin U-Tmod mRNA, complete cds . 
AF237631 

AF237631.1 GI:7288860 



and Fowler, V.M. 
isoforms 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 2072) 

AUTHORS Conley,C.A., Almenar-Queralt, A. 
TITLE Identifying novel tropomodulin 

JOURNAL Mol. Biol. Cell 9, 18A (1998) 
REFERENCE 2 (bases 1 to 2072) 

AUTHORS Conley,C.A. and Fowler, V.M. 
TITLE Direct Submission 

JOURNAL Submitted (23-FEB-2000) Space Life Sciences, NASA Ames Research 
Center, M/S 239-11, Moffett Field, CA 94035-1000, USA 
FEATURES Location/Qualifiers 
source 1..2072 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon : 9606" 
CDS 66. .1124 

/note="actin filament pointed-end capping protein" 
/codon_start=l 

/product="ubiquitous tropomodulin U-Tmod" 
/protein id=" AAF45299.1 " 
/db_xref="GI: 7288861" 

/translation="MALPFRKDLEKYKDLDEDELLGNLSETELKQLETVLDDLDPENA 
LLPAGFRQKNQTSKSTTGPFDREHLLSYLEKEALEHKDREDYVPYTGEKKGKIFIPKQ 
KPVQTFTEEKVSLDPELEEALTSASDTELCDLAAILGMHNLITNTKFCNIMGSSNGVD 
QEHFSNVVKGEKILPVFDEPPNPTNVEESLKRTKENDAHLVEVNLNNIKNIPIPTLKD 
FAKALETNTHVKC FSLAATRSN DPVATAFAEMLKVNKTLKSLNVE SN FI TGVGI LALI 
DALRDNETLAELKIDNQRQQLGTAVELEMAKMLEENTNILKFGYQFTQQGPRTRAANA 
ITKNNDLVRKRRVEGDHQ" 

ORIGIN 

1 ccaagattcg gcacgaggga ctgtactgaa cagcaaaaat taagtgactt gctgccctgc 
61 acatcatggc actgccattc cgtaaggact tagaaaagta caaagacctt gatgaagatg 
121 agctccttgg gaatctgtca gaaacagaac tgaaacaact ggaaactgtt ttggatgatc 
181 ttgaccccga gaatgccctt ctgcctgcag ggttccggca gaagaaccag acatcaaagt 
241 ccaccacagg gccatttgat agagagcatc tcctttcata tctggagaaa gaagcattgg 
301 agcataaaga cagggaagac tatgtgccct acactggaga aaaaaaaggg aaaatattta 

3 61 tccccaaaca gaaacctgta cagactttta cagaagaaaa agtgtctctt gatccagaat 
421 tagaagaagc tttgacaagt gcttctgata cagaattgtg tgacctcgca gcaattcttg 

4 81 ggatgcacaa tttgataacg aatacaaagt tctgtaatat aatgggaagt agtaatggtg 
541 ttgaccaaga acatttttca aatgtggtca aaggtgaaaa gattcttccg gtatttgatg 
601 agccaccaaa tccaaccaat gtagaagaga gtttgaagag aactaaagaa aacgatgctc 



1 of 2 



1/16/2009 10:29 AM 



NCBI Sequence Viewer v2.0 



http://www.ncbi.rdm.nih.gov/entrez/viewer.fcgi?db===nuccore&id=72... 



661 atcttgttga agttaatttg aataatataa agaatatccc aattccaacc ctaaaagatt 
721 ttgcaaaggc tttggaaacc aacacacatg tgaaatgttt cagtcttgca gccacccgga 
781 gcaatgaccc tgttgctact gcttttgcag aaatgctgaa agtgaacaaa actttgaaga 
841 gcttaaatgt ggagtccaac tttatcacgg gagttgggat tctggcactg attgatgcgt 
901 taagagataa tgaaaccctg gcagagctca agattgacaa tcagaggcag cagttgggga 
961 cagctgtaga attggaaatg gccaagatgc ttgaggaaaa tacaaatatc cttaaatttg 
1021 gatatcagtt tacacagcag ggaccacgaa ccagagcagc taatgctata acaaaaaaca 
1081 atgacttagt gcgtaagaga cgagttgaag gagatcacca gtaagtctgc aaaggtgtaa 
1141 tctttggaag acttcagaag atcaccaagg gctcatgttg gtgacatcat gtaaaatttt 
1201 cctgggtaga agggaaaaga ctggaaaaat ttttttagtg acatgcattt tttttttagt 

1261 tgttatcaaa ttgtaaaatc aqtaatgtqa tattttatat tctqaaacat ttctactttc 

1321 tgctaaaatc aattttaatt tagtttaatt gaatgattta tgatgaatct tgggcaaaaa 
1381 aatacaactg taaaaaattt cacaggtcat ttgtgtagaa taatttgaac attgtgagga 
1441 ccaatctttt ttaaatcaaa agggatgttg ctggtatcag aattgttatt gcttcattta 
1501 gacataaaac acttaagtgt tttccttcct ccgtgacctg gagagttttc cattttttaa 
1561 aatacatgaa cttggcaagg gtgtgatttt tctttatcaa gagaacaaaa tgccagaatg 
1621 ttaaaacagt taactcaaaa tctagtcatc tgtgtaggat aaagcaggtc tccctcccca 
1681 ccccccaaaa aaggcgtaca gccaactctt agttatttag ggtaagagaa aaaagaaaac 
1741 tgttaaacaa tgaaatcaaa gataataaat atgatagatt tccagtatga ggcactttgc 
1801 aaaacagtca tttaaactga tggttactgt aagtcagttg gaagctggca tgtatgtaaa 
1861 ttacttggtg ttacaccata tccgaagaag ttcctatgtg cagtgcagaa ttgcataaac 
1921 agtatttcaa tcactgctac aaatcttcca accaagaagg aaaactgcta ttcttcataa 
1981 atctaatttt tagtatatgc ttcctttcat cttatacttt tatcaatatt tataaaagtc 
2041 atttctataa taaaagcagt ctttcttgct ca 
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D00099. Reports Homo sapiens mRNA... [gi:2 19941] 
Comment Features Sequence 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
COMMENT 



FEATURES 

source 



HUMNKATPA 4108 bp mRNA linear PRI ll-MAR-1998 

Homo sapiens mRNA for Na, K-ATPase alpha-subunit, complete cds . 
D00099 

D00099.1 GI:219941 

Na, K-ATPase; alpha-subunit. 

Homo sapiens (human} 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 4108) 

Kawakami,K., Ohta,T., Nojima,H. and Nagano, K. 

Primary structure of the alpha-subunit of human Na, K-ATPase deduced 
from cDNA sequence 

J. Biochem. 100 (2), 389-397 (1986) 
2430951 

The alpha-subunit of the human Na, K-ATPase exhibited 87% homology 
with its Torpedo counterpart and 98% homology with its sheep 

Location/Qualifiers 
1. .4108 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/ db_xre f= " t axon : 9606 " 

/clone="pHNKA3, pHNKA21, and pHNKA54 " 
/cell_line="Hela cell" 

/clone_lib="cDNA library in lambda-gtll, a gift from Dr. 
P. Nielsen" 
319. .3390 
/codon_start=l 

/product="Na, K-ATPase alpha-subunit" 
/protein id=" BAA00061 . 1 " 
/db_xref="GI: 219942" 

/translation="MGKGVGRDKYEPAAVSEQGDKKGKKGKKDRDMDELKKEVSMDDH 
KLSLDELHRKYGTDLSRGLTSARAAEILARDGPNALTPPPTTPEWIKFCRQLFGGFSM 
LLWIGAILCFLAYSIQAATEEEPQNDNLYLGWLSAVVIITGCFSYYQEAKSSKIMES 
FKNMVPQQALVIRNGEKMSINAEEVVVGDLVEVKGGDRIPADLRIISANGCKVDNSSL 
TGESEPQTRSPDFTNENPLETRNIAFFSTNCVEGTARGIWYTGDRTVMGRIATLASG 
LEGGQTPIAAEIEHFIHIITGVAVFLGVSFFILSLILEYTWLEAVIFLIGIIVANVPE 
GLLATVTVCLTLTAKRMARKNCLVKNLEAVETLGSTSTICSDKTGTLTQNRMTVAHMW 
FDNQIHEADTTENQSGVSFDKTSATWLALSRIAGLCNRAVFQANQENLPILKRAVAGD 
ASESALLKCIELCCGSVKEMRERYAKIVEIPFNSTNKYQLSIHKNPNTSEPQHLLVMK 
GAPERILDRCSSILLHGKEQPLDEELKDAFQNAYLELGGLGERVLGFCHLFLPDEQFP 
EGFQFDTDDVNFPIDNLCFVGLISMIDPPRAAVPDAVGKCRSAGIKVIMVTGDHPITA 
KAIAKGVGIISEGNETVEDIAARLNIPVSQVNPRDAKACWHGSDLKDMTSEQLDDIL 
KYHTEIVFARTSPQQKLIIVEGCQRQGAIVAVTGDGVNDSPALKKADIGVAMGIAGSD 
VSKQAADMILLDDNFASIVTGVEEGRLIFDNLKKSIAYTLTSNIPEITPFLIFIIANI 
PLPLGTVTILCIDLGTDMVPAISLAYEQAESDIMKRQPRNPKTDKLVNERLISMAYGQ 
IGMIQALGGFFTYFVILAENGFLPIHLLGLRVDWDDRWINDVEDSYGQQWTYEQRKIV 
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EFTCHTAFFVSIVVVQWADLVICKTRRNSVFQQGMKNKILIFGLFEETALAAFLSYCP 
GMGVALRMYPLKPTWWFCAFPYSLLIFVYDEVRKLIIRRRPGGWVEKETYY" 
polyA signal 3696.. 3701 

/note="putative" 

ORIGIN 

1 gaattcttaa gtctaaggga cttgctcctg atcttcctga agatctctac catttaatta 
61 agaaagcagt tgctgttcga aagcatcttg agaggaacgg acacgtggca acagcggtac 
121 gagcccgggc ggcggcagca acagcggcgg cggcatcggc ccgagccgcg gcccgctccc 
181 accctcccgc cccgcggcag ccctagctcc ctccacttgg ctcccctggt cccgctcgct 
241 cggccgggag ctgctctgtg cttttctctc tgattctcca gcgacaggac ccggcgccgg 

^01 gi-anl-ga gra rrrjr.r.arr.af g ggg^agggg gtfggar.gtg ataagtatga gcctgcagct ; 

361 gtttcagaac aaggtgataa aaagggcaaa aagggcaaaa aagacaggga catggatgaa 
421 ctgaagaaag aagtttctat ggatgatcat aaacttagcc ttgatgaact tcatcgtaaa 
481 tatggaacag acttgagccg gggattaaca tctgctcgtg cagctgagat cctggcgcga 
541 gatggtccca acgccctcac tccccctccc actactcctg aatggatcaa gttttgtcgg 
601 cagctctttg gggggttctc aatgttactg tggattggag cgattctttg tttcttggct 
661 tatagcatcc aagctgctac agaagaggaa cctcaaaacg ataatctgta cctgggtgtg 
721 gtgctatcag ccgttgtaat cataactggt tgcttctcct actatcaaga agctaaaagt 
781 tcaaagatca tggaatcctt caaaaacatg gtccctcagc aagcccttgt gattcgaaat 
841 ggtgagaaaa tgagcataaa tgcggaggaa gttgtggttg gggatctggt ggaagtaaaa 
901 ggaggagacc gaattcctgc tgacctcaga atcatatctg caaatggctg caaggtggat 
961 aactcctcgc tcactggtga atcagaaccc cagactaggt ctccagattt cacaaatgaa 
1021 aaccccctgg agacgaggaa cattgccttc ttttcaacaa attgtgttga aggcaccgca 
1081 cgtggtattg ttgtctacac tggggatcgc actgtgatgg gaagaattgc cacacttgct 
1141 tctgggctgg aaggaggcca gacccccatt gctgcagaaa ttgaacattt tatccacatc 
1201 atcacgggtg tggctgtgtt cctgggtgtg tctttcttca tcctttctct catccttgag 
1261 tacacctggc ttgaggctgt catcttcctc atcggtatca tcgtagccaa tgtgccggaa 
1321 ggtttgctgg ccactgtcac ggtctgtctg acacttactg ccaaacgcat ggcaaggaaa 
1381 aactgcttag tgaagaactt agaagctgtg gagaccttgg ggtccacgtc caccatctgc 
1441 tctgataaaa ctggaactct gactcagaac cggatgacag tggcccacat gtggtttgac 
1501 aatcaaatcc atgaagctga tacgacagag aatcagagtg gtgtctcttt tgacaagact 
1561 tcagctacct ggcttgctct gtccagaatt gcaggtcttt gtaacagggc agtgtttcag 
1621 gctaaccagg aaaacctacc tattcttaag cgggcagttg caggagatgc ctctgagtca 
1681 gcactcttaa agtgcataga gctgtgctgt ggttccgtga aggagatgag agaaagatac 
1741 gccaaaatcg tcgagatacc cttcaactcc accaacaagt accagttgtc tattcataag 
1801 aaccccaaca catcggagcc ccaacacctg ttggtgatga agggcgcccc agaaaggatc 
18 61 ctagaccgtt gcagctctat cctcctccac ggcaaggagc agcccctgga tgaggagctg 
1921 aaagacgcct ttcagaacgc ctatttggag ctggggggcc tcggagaacg agtcctaggt 
1981 ttctgccacc tctttctgcc agatgaacag tttcctgaag ggttccagtt tgacactgac 
2041 gatgtgaatt tccctatcga taatctgtgc tttgttgggc tcatctccat gattgaccct 
2101 ccacgggcgg ccgttcctga tgccgtgggc aaatgtcgaa gtgctggaat taaggtcatc 
2161 atggtcacag gagaccatcc aatcacagct aaagctattg ccaaaggtgt gggcatcatc 
2221 tcagaaggca atgagaccgt ggaagacatt gctgcccgcc tcaacatccc agtcagccag 
2281 gtgaacccca gggatgccaa ggcctgcgta gtacacggca gtgatctaaa ggacatgacc 
2341 tccgagcagc tggatgacat tttgaagtac cacactgaga tagtgtttgc caggacctcc 
2401 cctcagcaga agctcatcat tgtggaaggc tgccaaagac agggtgctat cgtggctgtg 
24 61 actggtgacg gtgtgaatga ctctccagct ttgaagaaag cagacattgg ggttgctatg 
2521 gggattgctg gctcagatgt gtccaagcaa gctgctgaca tgattcttct ggatgacaac 
2581 tttgcctcaa ttgtgactgg agtagaggaa ggtcgtctga tctttgataa cttgaagaaa 
2641 tccattgctt ataccttaac cagtaacatt cccgagatca ccccgttcct gatatttatt 
2701 attgcaaaca ttccactacc actggggact gtcaccatcc tctgcattga cttgggcact 
27 61 gacatggttc ctgccatctc cctggcttat gagcaggctg agagtgacat catgaagaga 
2821 cagcccagaa atcccaaaac agacaaactt gtgaatgagc ggctgatcag catggcctat 
2881 gggcagattg gaatgatcca ggccctggga ggcttcttta cttactttgt gattctggct 
2941 gagaacggct tcctcccaat tcacctgttg ggcctccgag tggactggga tgaccgctgg 
3001 atcaacgatg tggaagacag ctacgggcag cagtggacct atgagcagag gaaaatcgtg 
3061 gagttcacct gccacacagc cttcttcgtc agtatcgtgg tggtgcagtg ggccgacttg 
3121 gtcatctgta agaccaggag gaattcggtc ttccagcagg ggatgaagaa caagatcttg 
3181 atatttggcc tctttgaaga gacagccctg gctgctttcc tttcctactg ccctggaatg 
3241 ggtgttgctc ttaggatgta tcccctcaaa cctacctggt ggttctgtgc cttcccctac 
3301 tctcttctca tcttcgtata tgacgaagtc agaaaactca tcatcaggcg acgccctggc 
3361 ggctgggtgg agaaggaaac ctactattag ccccccgtcc tgcacgccgt ggagcatcag 
3421 gccacacact ctgcatccga cacccacccc ctctttgtgt acttcagtct tggagtttgg 
3481 aactctaccc tggtaggaaa gcaccgcagc atgtggggaa gcaagacgtc ctggaatgaa 
3541 gcatgtagct ctatgggggg aggggggagg gctgcctgaa aaccatccat ctgtggaaat 
3601 gacagcgggg aaggttttta tgtgcctttt tgtttttgta aaaaaggaac acccggaaag 
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3661 actgaaagaa tacattttat atctggattt ttacaaataa agatggcaga aactgacttt 

3721 attttaaggg ttccagcatc ccagagccaa aaacaaatag tgccatctgc cccagtagaa 

3781 gatagatatc tctttgagcc actgcacaat ggtgagaact gtagtgatgt aatagatgca 

3841 ctatggccct gaagaacagc caaaggtgca caggttcgaa gacaccagac tcggatcatt 

3901 ttatcacaac ttccagctgc tatcatggta ttctcatagt ttacagccat gtctgatatt 

3961 tcagcagcat gtcctcttaa ggtagctaac aacctcccat catctgttgc ccatattttc 

4 021 acaagacagt catcagaacc agtaaatatc cgtctgccag ttcgatcaaa agttacacag 
4081 tacacagatg acaagtgtcc aagaattc 
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Cl: BC002795. Reports Homo sapiens plas...[gi:338771 95] 
Comment Features Sequence 

LOCUS BC002795 2409 bp mRNA linear PRI 19-MAR-2007 

DEFINITION Homo sapiens plasminogen activator, tissue, mRNA (cDNA clone 

MGC:3677 IMAGE : 3618149) , complete cds . 
ACCESSION BC002795 
VERSION BC002795.2 GI:33877195 

KEYWORDS MGC . 

SOURCE Homo sapiens (human) 

ORGAN I SM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

REFERENCE 1 (bases 1 to 2409) 

AUTHORS Strausberg,R.L., Feingold, E . A. , Grouse, L.H., Derge,J.G., 

Klausner,R.D. , Collins, F. S . , Wagner, L. , Shenmen,C.M. , Schuler , G . D . , 
Altschul, S . F. , Zeeberg,B., Buetow,K.H., Schaef er, C . F. , Bhat,N.K., 
Hopkins, R. F. , Jordan, H . , Moore, T., Max,S.I., Wang,J., Hsieh,F., 
Diatchenko, L. , Marusina,K., Farmer, A. A. , Rubin, G.M., Hong,L., 
Stapleton,M. , Soares,M.B., Bonaldo,M. F. , Casavant,T.L. , 
Scheetz,T.E. , Brownstein,M. J. , Usdin,T.B., Toshiyuki, S . , 
Carninci,P., Prange,C, Raha,S.S., Loquellano, N . A . , Peters, G.J., 
Abramson,R.D. , Mullahy, S . J. , Bosak,S.A., McEwan,P.J., 
McKernan, K . J . , Malek,J.A., Gunaratne, P . H . , Richards, S., 
Worley,K.C, Hale,S., Garcia, A.M., Gay,L.J., Hulyk,S.W., 
Villalon, D.K. , Muzny, D.M. , Sodergren, E . J . , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E . , Ketteman,M., Madan,A., Rodrigues, S . , 
Sanchez, A. , Whiting, M., Madan,A., Young, A. C . , Shevchenko, Y . , 
Bouffard, G.G. , Blakesley, R.W. , Touchman, J.W. , Green, E.D., 
Dickson, M. C. , Rodriguez, A. C. , Grimwood,J., Schmutz,J., Myers, R.M., 
Butterfield,Y.S., Krzywinski, M. I . , Skalska,U., Smailus, D.E. , 
Schnerch,A., Schein,J.E., Jones, S.J. and Marra,M.A. 
CONSRTM Mammalian Gene Collection Program Team 

TITLE Generation and initial analysis of more than 15,000 full-length 

human and mouse cDNA sequences 
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 
PUBMED 12477932 
REFERENCE 2 (bases 1 to 2409) 
CONSRTM NIH MGC Project 
TITLE Direct Submission 

JOURNAL Submitted (05-FEB-2001) National Institutes of Health, Mammalian 

Gene Collection (MGC), Bethesda, MD 20892-2590, USA 
REMARK NIH-MGC Project URL: http : //mqc . nci . nih ■ gov 
COMMENT On Aug 19, 2003 this sequence version replaced qi: 12803896 . 

Contact: MGC help desk 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: ATCC/DCTD/DTP 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: National Institutes of Health Intramural 

Sequencing Center (NISC), 

Gaithersburg, Maryland; 

Web site: http://www.nisc.nih.gov/ 

Contact: nisc_mgc8nhgri.nih.gov 

Akhter,N., Ayele,K., Beckstrom-Sternberg, S .M. , Benjamin, B., 
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Blakesley,R.W. , Bouffard,G.G. , Breen,K., Brinkley,C, Brooks, S., 
Dietrich, N. L. , Granite, S., Guan,X., Gupta, J. , Haghighi,P., 
Hansen, N . , Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., 
Maduro,Q.L. , Masiello,C, Maskeri,B., Mastrian, S . D. , McCloskey, J. C . , 
McDowell, J. , Pearson, R. , Stantripop, S . , Thomas, P.J. , Touchman, J.W . , 
Tsurgeon,C, Vogt,J.L., Walker, M. A., Wetherby,K.D. , Wiggins, L., 
Young, A., Zhang, L.-H. and Green, E.D. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http : / /image . llnl ■ gov 
Series: IRAL Plate: 12 Row: p Column: 15 

Th is clone w as □electe d for full l e ngth sequencing becaus e it 

passed the following selection criteria: matched mRNA gi : 14702165. 

Differences found between this sequence and the human reference 
genome (build 36) are described in misc_dif f erence features below. 

Location/Qualifiers 
> 1..2409 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref ="taxon : 9606" 

/clone="MGC:3677 IMAGE : 3618149" 

/tissue_type="Skin, melanotic melanoma." 

/clone_lib="NIH_MGC_20" 

/lab_host="DH10B-R" 

/note="Vector: pOTB7" 

1. .2409 

/gene="PLAT" 

/gene_synonym="TPA" 

/gene_synonym="T-PA" 

/db_xref="GeneID: 5327 " 

/ db_x r e f = " HGNC : 9051 " 

/db_xref="MIM: 173370 " 

84. .1634 

/gene="PLAT" 

/gene_synonym="TPA" 

/gene_synonym="T-PA" 

/codon_start=l 

/product="plasminogen activator, tissue" 

/protein id=" RAH02795 . 3 " 

/db_xref="GI : 127796397" 

/db_xref="GeneID: 5327 " 

/db_xref="HGNC:9051" 

/db_xref="MIM: 173370 " 

/translation="MDAMKRGLCCVLLLCGAVFVSPSQEIHARFRRGARSYQGCSEPR 
CFNGGTCQQALY FSDFVCQCPEGFAGKCCEIDTRATCYEDQGISYRGTWSTAESGAEC 
TNWNSSALAQKPYSGRRPDAIRLGLGNHNYCRNPDRDSKPWCYVFKAGKYSSEFCSTP 
ACSEGNSDCY FGNGSAYRGTHSLTESGASCLPWNSMILIGKVYTAQNPSAQALGLGKH 
NYCRNPDGDAKPWCHVLKNRRLTWEYCDVPSCSTCGLRQYSQPQFRIKGGLFADIASH 
PWQAAIFAKHRRSPGERFLCGGILISSCWILSAAHCFQERFPPHHLTVILGRTYRVVP 
GEEEQKFEVEKYIVHKEFDDDTYDNDIALLQLKSDSSRCAQESSVVRTVCLPPADLQL 
PDWTECELSGYGKHEALSPFYSERLKEAHVRLYPSSRCTSQHLLNRTVTDNMLCAGDT 
RSGGPQANLHDACQGDSGGPLVCLNDGRMTLVGIISWGLGCGQKDVPGVYTKVTNYLD 
WIRDNMRP" 



. 2395. .2409 
/gene="PLAT" 
/gene_synonym=' 
/gene_synonym=' 
/note="polyA t; 
genome . " 



TPA" 
T-PA" 

lil: 15 bases do not 



.lign to the human 



1 caggagtcca gggctggaga gaaaacctct gcgaggaaag ggaaggagca agccgtgaat 
61 ttaagggacg ctgtgaagca atcatggatg caatgaagag agggctctgc tgtgtgctgc 
121 tgctgtgtgg agcagtcttc gtttcgccca gecaggaaat ccatgcccga ttcagaagag 
181 gagecagate ttaccaaggt tgeagegage caaggtgttt caaeggggge acctgccagc 
241 aggccctgta cttctcagat ttcgtgtgcc agtgccccga aggatttget gggaagtgct 
301 gtgaaataga taccagggcc acgtgctacg aggaccaggg catcagctac aggggcacgt 
361 ggagcacagc ggagagtggc gecgagtgea ccaactggaa cagcagcgcg ttggcccaga 
421 agccctacag egggeggagg ccagacgcca tcaggctggg cctggggaac cacaactact 
481 gcagaaaccc agatcgagac tcaaagccct ggtgctacgt etttaaggeg gggaagtaca 
541 gctcagagtt ctgcagcacc cctgcctgct ctgagggaaa cagtgactgc tactttggga 
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601 atgggtcagc ctaccgtggc acgcacagcc tcaccgagtc gggtgcctcc tgcctcccgt 
661 ggaattccat gatcctgata ggcaaggttt acacagcaca gaaccccagt gcccaggcac 
721 tgggcctggg caaacataat tactgccgga atcctgatgg ggatgccaag ccctggtgcc 
781 acgtgctgaa gaaccgcagg ctgacgtggg agtactgtga tgtgccctcc tgctccacct 
841 gcggcctgag acagtacagc cagcctcagt ttcgcatcaa aggagggctc ttcgccgaca 
901 tcgcctccca cccctggcag gctgccatct ttgccaagca caggaggtcg cccggagagc 
961 ggttcctgtg cgggggcata ctcatcagct cctgctggat tctctctgcc gcccactgct 
1021 tccaggagag gtttccgccc caccacctga cggtgatctt gggcagaaca taccgggtgg 
1081 tccctggcga ggaggagcag aaatttgaag tcgaaaaata cattgtccat aaggaattcg 
1141 atgatgacac ttacgacaat gacattgcgc tgctgcagct gaaatcggat tcgtcccgct 
1201 gtgcccagga gagcagcgtg gtccgcactg tgtgccttcc cccggcggac ctgcagctgc 
12 6 1 cgg actggac ggagtgtgag ctctccggct acggcaagca tgaggcct tg tctcctttct — 
1321 attcggagcg gctgaaggag gctcatgtca gactgtaccc atccagccgc tgcacatcac 
1381 aacatttact taacagaaca gtcaccgaca acatgctgtg tgctggagac actcggagcg 
1441 gcgggcccca ggcaaacttg cacgacgcct gccagggcga ttcgggaggc cccctggtgt 
1501 gtctgaacga tggccgcatg actttggtgg gcatcatcag ctggggcctg ggctgtggac 
1561 agaaggatgt cccgggtgtg tacaccaagg ttaccaacta cctagactgg attcgtgaca 
1621 acatgcgacc gtgaccagga acacccgact cctcaaaagc aaatgagatc ccgcctcttc 
1681 ttcttcagaa gacactgcaa aggcgcagtg cttctctaca gacttctcca gacccaccac 
1741 accgcagaag cgggacgaga ccctacagga gagggaagag tgcattttcc cagatacttc 
1801 ccattttgga agttttcagg acttggtctg atttcaggat actctgtcag atgggaagac 
1861 atgaatgcac actagcctct ccaggaatgc ctcctccctg ggcagaaagt ggccatgcca 
1921 ccctgttttc agctaaagcc caacctcctg acctgtcacc gtgagcagct ttggaaacag 
1981 gaccacaaaa atgaaagcat gtctcaatag taaaagataa caagatcttt caggaaagac 
2041 ggattgcatt agaaatagac agtatattta tagtcacaag agcccagcag ggcctcaaag 
2101 ttggggcagg ctggctggcc cgtcatgttc ctcaaaagca cccttgacgt caagtctcct 
2161 tcccctttcc ccactccctg gctctcagaa ggtattcctt ttgtgtacag tgtgtaaagt 
2221 gtaaatcctt tttctttata aactttagag tagcatgaga gaattgtatc atttgaacaa 
2281 ctaggcttca gcatatttat agcaatccat gttagttttt actttctgtt gccacaaccc 
2341 tgttttatac tgtacttaat aaattcagat atatttttca cagtttttcc aaaaaaaaaa 
2401 aaaaaaaaa 
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